Search CORE

61 research outputs found

Unsupervised grammar induction of clinical report sublanguage

Author: Rohit J Kate
Publication venue: Springer Nature
Publication date: 01/01/2012
Field of study

BACKGROUND: Clinical reports are written using a subset of natural language while employing many domain-specific terms; such a language is also known as a sublanguage for a scientific or a technical domain. Different genres of clinical reports use different sublaguages, and in addition, different medical facilities use different medical language conventions. This makes supervised training of a parser for clinical sentences very difficult as it would require expensive annotation effort to adapt to every type of clinical text. METHODS: In this paper, we present an unsupervised method which automatically induces a grammar and a parser for the sublanguage of a given genre of clinical reports from a corpus with no annotations. In order to capture sentence structures specific to clinical domains, the grammar is induced in terms of semantic classes of clinical terms in addition to part-of-speech tags. Our method induces grammar by minimizing the combined encoding cost of the grammar and the corresponding sentence derivations. The probabilities for the productions of the induced grammar are then learned from the unannotated corpus using an instance of the expectation-maximization algorithm. RESULTS: Our experiments show that the induced grammar is able to parse novel sentences. Using a dataset of discharge summary sentences with no annotations, our method obtains 60.5% F-measure for parse-bracketing on sentences of maximum length 10. By varying a parameter, the method can induce a range of grammars, from very specific to very general, and obtains the best performance in between the two extremes

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Clinical Term Normalization Using Learned Edit Patterns and Subconcept Matching: System Development and Evaluation

Author: Kate Rohit J.
Publication venue: UWM Digital Commons
Publication date: 14/01/2021
Field of study

Background: Clinical terms mentioned in clinical text are often not in their standardized forms as listed in clinical terminologies because of linguistic and stylistic variations. However, many automated downstream applications require clinical terms mapped to their corresponding concepts in clinical terminologies, thus necessitating the task of clinical term normalization. Objective: In this paper, a system for clinical term normalization is presented that utilizes edit patterns to convert clinical terms into their normalized forms. Methods: The edit patterns are automatically learned from the Unified Medical Language System (UMLS) Metathesaurus as well as from the given training data. The edit patterns are generalized sequences of edits that are derived from edit distance computations. The edit patterns are both character based as well as word based and are learned separately for different semantic types. In addition to these edit patterns, the system also normalizes clinical terms through the subconcepts mentioned within them. Results: The system was evaluated as part of the 2019 n2c2 Track 3 shared task of clinical term normalization. It obtained 80.79% accuracy on the standard test data. This paper includes ablation studies to evaluate the contributions of different components of the system. A challenging part of the task was disambiguation when a clinical term could be normalized to multiple concepts. Conclusions: The learned edit patterns led the system to perform well on the normalization task. Given that the system is based on patterns, it is human interpretable and is also capable of giving insights about common variations of clinical terms mentioned in clinical text that are different from their standardized forms

University of Wisconsin-Milwaukee

Learning for clinical named entity recognition without manual annotations

Author: Ghiasvand Omid
Kate Rohit J.
Publication venue: UWM Digital Commons
Publication date: 30/10/2018
Field of study

Background: Named entity recognition (NER) systems are commonly built using supervised methods that use machine learning to learn from corpora manually annotated with named entities. However, manually annotating corpora is very expensive and laborious. Materials and methods: In this paper, a novel method is presented for training clinical NER systems that does not require any manual annotations. It only requires a raw text corpus and a resource like UMLS that can give a list of named entities along with their semantic types. Using these two resources, annotations are automatically obtained to train machine learning methods. The method was evaluated on the NER shared-task datasets of i2b2 2010 and SemEval 2014. Results: On the SemEval 2014 dataset for recognizing diseases and disorders, the method obtained F-measure of 0.693 for exact matching and of 0.773 allowing overlaps. This is comparable to many supervised systems in the past that had used manual annotations for training. On the i2b2 2010 dataset for recognizing problems, tests and treatments, the method obtained F-measures of 0.451, 0.338 and 0.204 respectively for exact matching and of 0.721, 0.587 and 0.475 respectively allowing overlaps. These results are better than an existing unsupervised method. Conclusions: Experiments on standard datasets showed that the new method performed well. The method is general and could be applied to recognize entities of other types on other genres of text without needing manual annotations

University of Wisconsin-Milwaukee

Genie: A Generator of Natural Language Semantic Parsers for Virtual Assistant Commands

Author: Alvarez-Melis David
Banarescu Laura
Chen David L
Chu Shumo
Ganitkevitch Juri
Kate Rohit J
Kingma Diederik P
Pasupat Panupong
Quirk Chris
Shetty Jitesh
Steedman Mark
Trakhtenbrot Boris A.
Wang Yushi
Wong Yuk Wah
Xu Xiaojun
Zelle John M
Zettlemoyer Luke S
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/04/2019
Field of study

To understand diverse natural language commands, virtual assistants today are trained with numerous labor-intensive, manually annotated sentences. This paper presents a methodology and the Genie toolkit that can handle new compound commands with significantly less manual effort. We advocate formalizing the capability of virtual assistants with a Virtual Assistant Programming Language (VAPL) and using a neural semantic parser to translate natural language into VAPL code. Genie needs only a small realistic set of input sentences for validating the neural model. Developers write templates to synthesize data; Genie uses crowdsourced paraphrases and data augmentation, along with the synthesized data, to train a semantic parser. We also propose design principles that make VAPL languages amenable to natural language translation. We apply these principles to revise ThingTalk, the language used by the Almond virtual assistant. We use Genie to build the first semantic parser that can support compound virtual assistants commands with unquoted free-form parameters. Genie achieves a 62% accuracy on realistic user inputs. We demonstrate Genie's generality by showing a 19% and 31% improvement over the previous state of the art on a music skill, aggregate functions, and access control.Comment: To appear in PLDI 201

arXiv.org e-Print Archive

Crossref

NEUROlogical Prognosis After Cardiac Arrest in Kids (NEUROPACK) study: protocol for a prospective multicentre clinical prediction model derivation and validation study in children after cardiac arrest

Introduction Currently, we are unable to accurately predict mortality or neurological morbidity following resuscitation after paediatric out of hospital (OHCA) or in-hospital (IHCA) cardiac arrest. A clinical prediction model may improve communication with parents and families and risk stratification of patients for appropriate postcardiac arrest care. This study aims to the derive and validate a clinical prediction model to predict, within 1 hour of admission to the paediatric intensive care unit (PICU), neurodevelopmental outcome at 3 months after paediatric cardiac arrest. Methods and analysis A prospective study of children (age: >24 hours and <16 years), admitted to 1 of the 24 participating PICUs in the UK and Ireland, following an OHCA or IHCA. Patients are included if requiring more than 1 min of cardiopulmonary resuscitation and mechanical ventilation at PICU admission Children who had cardiac arrests in PICU or neonatal intensive care unit will be excluded. Candidate variables will be identified from data submitted to the Paediatric Intensive Care Audit Network registry. Primary outcome is neurodevelopmental status, assessed at 3 months by telephone interview using the Vineland Adaptive Behavioural Score II questionnaire. A clinical prediction model will be derived using logistic regression with model performance and accuracy assessment. External validation will be performed using the Therapeutic Hypothermia After Paediatric Cardiac Arrest trial dataset. We aim to identify 370 patients, with successful consent and follow-up of 150 patients. Patient inclusion started 1 January 2018 and inclusion will continue over 18 months. Ethics and dissemination Ethical review of this protocol was completed by 27 September 2017 at the Wales Research Ethics Committee 5, 17/WA/0306. The results of this study will be published in peer-reviewed journals and presented in conferences. Trial registration number NCT03574025

University of Birmingham Research Portal

Directory of Open Access Journals

White Rose Research Online

Chromosomal microarray testing in adults with intellectual disability presenting with comorbid psychiatric disorders.

Author: A Castillo
Andrew McQuillin
André Strydom
Angela Hassiotis
Caroline Ogilvie
Christine Patch
Deborah Morrogh
Dimitrios Paschos
DT Miller
E Hladilkova
E Palmer
E Rees
F Degenhardt
Frances Flinter
G Costain
G Giaroli
G Kirov
GM Cooper
I Freunscht
J De Villiers
J Iyer
J Rojahn
Jane McCarthy
Jennifer Carter
Johan H Thygesen
K Baker
K Baker
Kate Wolfe
LA Weiss
M Kharbanda
M Lingen
M Viñas-Jornet
MI Srebniak
Mo Eyeoyibo
N Huang
Nagarajan Perumal
Nick Bass
NJ Bass
O Palumbo
Peter Cutajar
Raja Mukherjee
Rohit Shankar
S Garg
S Moss
S-A Cooper
Saif Sharif
Stephen Read
Suchithra Thirulokachandran
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Chromosomal copy-number variations (CNVs) are a class of genetic variants highly implicated in the aetiology of neurodevelopmental disorders, including intellectual disabilities (ID), schizophrenia and autism spectrum disorders (ASD). Yet the majority of adults with idiopathic ID presenting to psychiatric services have not been tested for CNVs. We undertook genome-wide chromosomal microarray analysis (CMA) of 202 adults with idiopathic ID recruited from community and in-patient ID psychiatry services across England. CNV pathogenicity was assessed using standard clinical diagnostic methods and participants underwent comprehensive medical and psychiatric phenotyping. We found an 11% yield of likely pathogenic CNVs (22/202). CNVs at recurrent loci, including the 15q11-q13 and 16p11.2-p13.11 regions were most frequently observed. We observed an increased frequency of 16p11.2 duplications compared with those reported in single-disorder cohorts. CNVs were also identified in genes known to effect neurodevelopment, namely NRXN1 and GRIN2B. Furthermore deletions at 2q13, 12q21.2-21.31 and 19q13.32, and duplications at 4p16.3, 13q32.3-33.3 and Xq24-25 were observed. Routine CMA in ID psychiatry could uncover ~11% new genetic diagnoses with potential implications for patient management. We advocate greater consideration of CMA in the assessment of adults with idiopathic ID presenting to psychiatry services

Crossref

UCL Discovery

PubMed Central

Plymouth Electronic Archive and Research Library

King's Research Portal

Learning an Executable Neural Semantic Parser

Author: Andreas Jacob
Artzi Yoav
Bahdanau Dzmitry
Baker Collin F.
Banarescu Laura
Berant Jonathan
Berant Jonathan
Cai Qingqing
Chen David L.
Clarke James
Jianpeng Cheng
Kate Rohit J.
Kim Yoon
Krishnamurthy Jayant
Kwiatkowksi Tom
Kwiatkowski Tom
Kwiatkowski Tom
Lafferty John
Liang Percy
Matuszek Cynthia
Mirella Lapata
Neelakantan Arvind
Neelakantan Arvind
Reed Scott
Siva Reddy
Steedman Mark
Sutskever Ilya
Sutskever Ilya
Vaswani Ashish
Vijay Saraswat
Zelle John M.
Zettlemoyer Luke
Zettlemoyer Luke S.
Zhong Victor
Publication venue: 'MIT Press - Journals'
Publication date: 12/08/2018
Field of study

This paper describes a neural semantic parser that maps natural language utterances onto logical forms which can be executed against a task-specific environment, such as a knowledge base or a database, to produce a response. The parser generates tree-structured logical forms with a transition-based approach which combines a generic tree-generation algorithm with domain-general operations defined by the logical language. The generation process is modeled by structured recurrent neural networks, which provide a rich encoding of the sentential context and generation history for making predictions. To tackle mismatches between natural language and logical form tokens, various attention mechanisms are explored. Finally, we consider different training settings for the neural semantic parser, including a fully supervised training where annotated logical forms are given, weakly-supervised training where denotations are provided, and distant supervision where only unlabeled sentences and a knowledge base are available. Experiments across a wide range of datasets demonstrate the effectiveness of our parser.Comment: In Journal of Computational Linguistic

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Phenotypic Characterization of EIF2AK4 Mutation Carriers in a Large Cohort of Patients Diagnosed Clinically With Pulmonary Arterial Hypertension.

Author: Charaka Hadinnapola
Marta Bleda
Matthias Haimel
Nicholas Screaton
Andrew Swift
Peter Dorfmüller
Stephen D. Preston
Mark Southwood
Jules Hernandez-Sanchez
Jennifer Martin
Carmen Treacy
Katherine Yates
Harm Bogaard
Colin Church
Gerry Coghlan
Robin Condliffe
Paul A. Corris
Simon Gibbs
Barbara Girerd
Simon Holden
Marc Humbert
David G. Kiely
Allan Lawrie
Rajiv Machado
Robert MacKenzie Ross
Shahin Moledina
David Montani
Michael Newnham
Andrew Peacock
Joanna Pepke-Zaba
Paula Rayner-Matthews
Olga Shamardina
Florent Soubrier
Laura Southgate
Jay Suntharalingam
Mark Toshner
Richard Trembath
Anton Vonk Noordegraaf
Martin R. Wilkins
Stephen J. Wort
John Wharton
Stefan Gräf
Nicholas W. Morrell
Timothy Aitman
David Bennett
Mark Caulfield
Patrick Chinnery
Daniel Gale
Ania Koziell
Taco W Kuijpers
Michael A Laffan
Eamonn Maher
Hugh S Markus
Willem H Ouwehand
David Perry
F Lucy Raymond
Irene Roberts
Kenneth Smith
Adrian Thrasher
Hugh Watkins
Catherine Williamson
Geoffrey Woods
Sofie Ashford
John R Bradley
Debra Fletcher
Tracey Hammerton
Roger James
Nathalie Kingston
Willem H Ouwehand
Christopher J Penkett
F Lucy Raymond
Kathleen Stirrups
Marijke Veltman
Tim Young
Sofie Ashford
Matthew Brown
Naomi Clements-Brod
John Davis
Eleanor Dewhurst
Marie Erwood
Amy Frary
Rachel Linger
Sofia Papadia
Karola Rehnstrom
Hannah Stark
David Allsup
Steve Austin
Tamam Bakchoul
Tadbir K Bariana
Paula Bolton-Maggs
Elizabeth Chalmers
Peter Collins
Wendy N Erber
Tamara Everington
Remi Favier
Kathleen Freson
Bruce Furie
Michael Gattens
Keith Gomez
Daniel Greene
Andreas Greinacher
Daniel Hart
Johan WM Heemskerk
Yvonne Henskens
Rashid Kazmi
David Keeling
Anne M Kelly
Michael A Laffan
Michele P Lambert
Claire Lentaigne
Ri Liesner
Sarah Mangles
Mary Mathias
Carolyn M Millar
Andrew Mumford
Paquita Nurden
Willem H Ouwehand
Sofia Papadia
Jeanette Payne
John Pasi
David J Perry
Kathelijne Peerlinck
Michael Richards
Matthew Rondina
Catherine Roughley
Sol Schulman
Harald Schulze
Marie Scully
Suthesh Sivapalaratnam
R Campbell Tait
Kate Talks
Jecko Thachil
Ernest Turro
Cheng-Hock Toh
Chris Van Geet
Minka De Vries
Timothy Q Warner
Sarah Westbury
Abigail Furnell
Rutendo Mapeta
Ilenia Simeoni
Simon Staines
Jonathan Stephens
Kathleen Stirrups
Deborah Whitehorn
Christopher Watt
Antony Attwood
Louise Daugherty
Sri VV Deevi
Csaba Halmagyi
Fengyuan Hu
Roger James
Vera Matser
Stuart Meacham
Karyn Megy
Christopher J Penkett
Kathleen Stirrups
Catherine Titterton
Salih Tuna
Ping Yu
Julie von Ziegenweldt
William Astle
Keren Carss
Daniel Greene
Hana Lango-Allen
Ernest Turro
William Astle
Daniel Greene
Sylvia Richardson
Ernest Turro
Paul Calleja
Stuart Rankin
Wojciech Turek
Christine Bryson
Julie Anderson
Debra Fletcher
Coleen McJannet
Sophie Stock
Tim Young
Evangeline Wassmer
Aman Sohal
Saikat Santra
Julie Vogt
Manali Chitre
Deepa Krishnakumar
Gautum Ambegaonkar
Anna Maw
Ruth Armstrong
Soo-Mi Park
Sarju Mehta
Joan Paterson
Jenny Carmichael
Louise Allen
Anke Hensiek
Helen Firth
Penelope Stein
Patrick Deegan
Rainer Doffinger
Alasdair Parker
Maria Bitner-Glindzicz
Richard Scott
Jane Hurst
Elisabeth Rosser
Melissa Lees
Emma Clement
Robert Henderson
Dorothy Thompson
Alice Gardham
Paul Gissen
Dragana Josifova
Ellen Thomas
Chris Patch
Charu Deshpande
Frances Flinter
Muriel Holder
Natalie Canham
Emma Wakeling
Susan Holder
Neeti Ghali
Angie Brady
Virginia Clowes
Robert MacLaren
Andrew Webster
Anthony Moore
Gavin Arno
Michel Michaelides
Julia Rankin
Manju Kurian
Elaine Murphy
Keren Carss
Alba Sanchis-Juan
Marie Erwood
Eleanor Dewhurst
Detelina Grozeva
F Lucy Raymond
Evan Reid
Geoff Woods
Marc Tischkowitz
Richard Sandford
Sonia Ali
Amanda Creaser-Myers
Victoria Cookson
Rosa DaCosta
Natalie Dormand
Pavandeep K Ghataorhe
Alan Greenhalgh
Anna Huis in’t Veld
Fiona Kennedy
Rob Mackenzie Ross
Larahmie Masati
Sharon Meehan
Shokri Othman
Val Pollock
Gary Polwarth
Christopher J Rhodes
Kevin Rue-Albrecht
Gwen Schotte
Debbie Shipley
Yvonne Tan
Ivy Wanjiku
John Wort
Kenneth Smith
Taco Kuijpers
Adrian Thrasher
James Thaventhiran
Matthew Brown
Hana Lango Allen
Ilenia Simeoni
Emily Staples
Crina Samarghitean
Hana Alachkar
Richard Antrobus
Gururaj Arumugakani
Chiara Bacchelli
Helen Baxendale
Claire Bethune
Shahnaz Bibi
Claire Booth
Michael Browning
Siobhan Burns
Anita Chandra
Nichola Cooper
Sophie Davies
Lisa Devlin
Rainer Doffinger
Elizabeth Drewe
David Edgar
William Egner
Rohit Ghurye
Kimberley Gilmour
Sarah Goddard
Pavel Gordins
Sofia Grigoriadou
Scott Hackett
Rosie Hague
Grant Hayman
Archana Herwadkar
Aarnoud Huissoon
Stephen Jolles
Peter Kelleher
Dinakantha Kumararatne
Sara Lear
Hilary Longhurst
Lorena Lorenzo
Jesmeen Maimaris
Ania Manson
Elizabeth McDermott
Sai Murng
Sergey Nejentsev
Sadia Noorani
Eric Oksenhendler
Mark Ponsford
Waseem Qasim
Isabella Quinti
Alex Richter
Ravishankar Sargur
Sinisa Savic
Suranjith Seneviratne
Carrock Sewell
Hans Stauss
Moira Thomas
Steve Welch
Lisa Willcocks
Nigel Yeatman
Patrick Yong
null null
Publication venue: American Heart Association
Publication date: 01/01/2017
Field of study

BACKGROUND: Pulmonary arterial hypertension (PAH) is a rare disease with an emerging genetic basis. Heterozygous mutations in the gene encoding the bone morphogenetic protein receptor type 2 (BMPR2) are the commonest genetic cause of PAH, whereas biallelic mutations in the eukaryotic translation initiation factor 2 alpha kinase 4 gene (EIF2AK4) are described in pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis. Here, we determine the frequency of these mutations and define the genotype-phenotype characteristics in a large cohort of patients diagnosed clinically with PAH. METHODS: Whole-genome sequencing was performed on DNA from patients with idiopathic and heritable PAH and with pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis recruited to the National Institute of Health Research BioResource-Rare Diseases study. Heterozygous variants in BMPR2 and biallelic EIF2AK4 variants with a minor allele frequency of <1:10 000 in control data sets and predicted to be deleterious (by combined annotation-dependent depletion, PolyPhen-2, and sorting intolerant from tolerant predictions) were identified as potentially causal. Phenotype data from the time of diagnosis were also captured. RESULTS: Eight hundred sixty-four patients with idiopathic or heritable PAH and 16 with pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis were recruited. Mutations in BMPR2 were identified in 130 patients (14.8%). Biallelic mutations in EIF2AK4 were identified in 5 patients with a clinical diagnosis of pulmonary veno-occlusive disease/pulmonary capillary hemangiomatosis. Furthermore, 9 patients with a clinical diagnosis of PAH carried biallelic EIF2AK4 mutations. These patients had a reduced transfer coefficient for carbon monoxide (Kco; 33% [interquartile range, 30%-35%] predicted) and younger age at diagnosis (29 years; interquartile range, 23-38 years) and more interlobular septal thickening and mediastinal lymphadenopathy on computed tomography of the chest compared with patients with PAH without EIF2AK4 mutations. However, radiological assessment alone could not accurately identify biallelic EIF2AK4 mutation carriers. Patients with PAH with biallelic EIF2AK4 mutations had a shorter survival. CONCLUSIONS: Biallelic EIF2AK4 mutations are found in patients classified clinically as having idiopathic and heritable PAH. These patients cannot be identified reliably by computed tomography, but a low Kco and a young age at diagnosis suggests the underlying molecular diagnosis. Genetic testing can identify these misclassified patients, allowing appropriate management and early referral for lung transplantation

Maastricht University Research Portal

Crossref

University of Birmingham Research Portal

Spiral - Imperial College Digital Repository

King's Research Portal

White Rose Research Online

St George's Online Research Archive

Explore Bristol Research

FigShare

Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data.

Author: Afzal Maryam
Aitman Timothy
Alachkar Hana
Alavijeh Omid S.
Ali Sonia
Ali Souad
Allen Louise
Allsup David
Ambegaonkar Gautum
Ancliff Phil
Anderson Julie
Antrobus Richard
Armstrong Ruth
Arno Gavin
Arumugakani Gururaj
Ashford Sofie
Astle William
Attwood Antony
Austin Steve
Babbs Christian
Bacchelli Chiara
Bakchoul Tamam
Bariana Tadbir K.
Baxendale Helen
Bennett David
Bethune Claire
Bibi Shahnaz
Bitner-Glindzicz Maria
Bleda Marta
Boggard Harm J.
Bolton-Maggs Paula
Booth Claire
Bradley John R.
Brady Angie
Brown Matthew
Browning Michael
Bryson Christine
Burns Siobhan
Calleja Paul
Canham Natalie
Carmichael Jenny
Carss Keren
Caulfield Mark
Chalmers Elizabeth
Chandra Anita
Chinnery Patrick
Chitre Manali
Chong Sam
Church Colin
Clement Emma
Clements-Brod Naomi
Clowes Virginia
Coghlan Gerry
Colby Elizabeth
Collins Janine
Collins Peter
Cook H. Terry
Cookson Victoria
Cooper Nichola
Corris Paul A.
Creaser-Myers Amanda
DaCosta Rosa
Daugherty Louise
Davies Sophie
Davis John
De Vries Minka
Deegan Patrick
Deevi Sri V. V.
Deshpande Charu
Devlin Lisa
Dewhurst Eleanor
Dixon Peter
Doffinger Rainer
Dolling Helen
Dormand Natalie
Drewe Elizabeth
Edgar David
Egner William
Emmerson Ingrid
Erber Wendy N.
Erwood Marie
Everington Tamara
Eyries Mélanie
Farmery James H. R.
Favier Remi
Firth Helen
Fletcher Debra
Flinter Frances
Frary Amy
French Courtney
Freson Kathleen
Furie Bruce
Furnell Abigail
Gale Daniel
Gall Henning
Gardham Alice
Gattens Michael
Gebhart Johanna
Geet Chris Van
Ghali Neeti
Ghataorhe Pavandeep K.
Ghio Stefano
Ghofrani Ardi
Ghurye Rohit
Gibbs J. Simon R.
Gilmour Kimberley
Ginsberg Lionel
Girerd Barbara
Gissen Paul
Goddard Sarah
Gomez Keith
Gordins Pavel
Gosal David
Greene Daniel
Greenhalgh Alan
Greinacher Andreas
Gresele Paolo
Grigoriadou Sofia
Grozeva Detelina
Gräf Stefan
Hackett Scott
Hadden Rob
Hadinnapola Charaka
Hague Rosie
Haimel Matthias
Halmagyi Csaba
Hammerton Tracey
Harper Lorraine
Hart Daniel
Hayman Grant
Heemskerk Johan W. M.
Henderson Robert
Hensiek Anke
Henskens Yvonne
Herwadkar Archana
Holden Simon
Holder Muriel
Holder Susan
Horvath Rita
Houweling Arjan C.
Hu Fengyuan
Hudson Gavin
Huissoon Aarnoud
Humbert Marc
Hurst Jane
James Roger
Johnson Sally
Jolles Stephen
Josifova Dragana
Kazmi Rashid
Keeling David
Kelleher Peter
Kelly Anne M.
Kennedy Fiona
Kiely David G.
Kingston Nathalie
Kovacs Gabor
Koziell Ania
Krishnakumar Deepa
Kuijpers Taco
Kuijpers Taco W.
Kumararatne Dinakantha
Kurian Manju
Laffan Michael A.
Lambert Michele P.
Lango Allen Hana
Lango-Allen Hana
Lawrie Allan
Layton Mark
Lear Sara
Lees Melissa
Lentaigne Claire
Levine Adam P.
Liesner Ri
Linger Rachel
Longhurst Hilary
Lorenzo Lorena
Louka Eleni
Lynch Andy G.
Machado Rajiv
MackenzieRoss Rob V.
MacLaren Robert
Mahdi-Rogers Mohamed
Maher Eamonn
Maimaris Jesmeen
Makris Mike
Man Patrick Yu Wai
Mangles Sarah
Manson Ania
Manzur Adnan
Mapeta Rutendo
Markus Hugh S.
Marshall Andrew
Martin Jennifer
Martin Jennifer M.
Masati Larahmie
Mathias Mary
Matser Vera
Matthews Emma
Maw Anna
McCarthy Mark
McDermott Elizabeth
McGowan Simon
McJannet Coleen
Meacham Stuart
Mead Adam
Meehan Sharon
Megy Karyn
Mehta Sarju
Michaelides Michel
Millar Carolyn M.
Moledina Shahin
Montani David
Moore Anthony
Morrell Nicholas W.
Mumford Andrew
Murng Sai
Murphy Elaine
Nejentsev Sergey
Noorani Sadia
Nurden Paquita
Oksenhendler Eric
Ormondroyd Liz
Othman Shokri
Ouwehand Willem H.
Papadia Sofia
Park Soo-Mi
Parker Alasdair
Pasi John
Patch Chris
Paterson Joan
Payne Jeanette
Peacock Andrew J.
Peerlinck Kathelijne
Penkett Christopher J.
Pepke-Zaba Joanna
Perry David J.
Pollock Val
Polwarth Gary
Ponsford Mark
Qasim Waseem
Quinti Isabella
Ranganathan Lavanya
Rankin Julia
Rankin Stuart
Raymond F. Lucy
Rayner-Matthews Paula
Rehnstrom Karola
Reid Evan
Reilly Mary
Renton Tara
Revel-Vilk Shoshana
Rhodes Christopher J.
Rice Andrew
Richards Michael
Richardson Sylvia
Richter Alex
Roberts Irene
Rondina Matthew
Rosser Elisabeth
Roughley Catherine
Roy Noémi
Rue-Albrecht Kevin
Saleem Moin
Samarghitean Crina
Sanchis-Juan Alba
Sandford Richard
Santra Saikat
Sargur Ravishankar
Savic Sinisa
Scelsi Laura
Schotte Gwen
Schulman Sol
Schulze Harald
Scott Richard
Scully Marie
Seneviratne Suranjith
Sewell Carrock
Shamardina Olga
Shipley Debbie
Simeoni Ilenia
Sivapalaratnam Suthesh
Smith Kenneth G. C.
Smith Mike L.
Sohal Aman
Soubrier Florent
Southgate Laura
Staines Simon
Staples Emily
Stark Hannah
Stauss Hans
Stein Penelope
Stephens Jonathan
Stirrups Kathleen
Stock Sophie
Stubbs Matthew
Suntharalingam Jay
Tait R. Campbell
Talks Kate
Tan Rhea
Tan Yvonne
Thachil Jecko
Thaventhiran James
Themistocleous Andreas
Thomas Ellen
Thomas Moira
Thompson Dorothy
Thrasher Adrian
Tischkowitz Marc
Titterton Catherine
Toh Cheng-Hock
Toshner Mark
Treacy Carmen M.
Trembath Richard
Tuna Salih
Turek Wojciech
Turro Ernest
Vale Tom
Veld Anna Huis in’t
Veltman Marijke
Vogt Julie
von Ziegenweldt Julie
Vonk Noordegraaf Anton
Wakeling Emma
Walker Sara
Walker Suellen
Wanjiku Ivy
Warner Timothy Q.
Wassmer Evangeline
Watkins Hugh
Watson Henry
Watt Christopher
Webster Andrew
Wei Wei
Welch Steve
Westbury Sarah
Wharton John
Whitehorn Deborah
Whitworth James
Wilkins Martin
Willcocks Lisa
Williamson Catherine
Wong Edwin K. S.
Woods Geoff
Wort Stephen J.
Yates Katherine
Yeatman Nigel
Yong Patrick
Young Tim
Yu Ping
Zuydam Natalie Van
Publication venue: Sci Rep
Publication date: 01/01/2018
Field of study

Telomere length is a risk factor in disease and the dynamics of telomere length are crucial to our understanding of cell replication and vitality. The proliferation of whole genome sequencing represents an unprecedented opportunity to glean new insights into telomere biology on a previously unimaginable scale. To this end, a number of approaches for estimating telomere length from whole-genome sequencing data have been proposed. Here we present Telomerecat, a novel approach to the estimation of telomere length. Previous methods have been dependent on the number of telomeres present in a cell being known, which may be problematic when analysing aneuploid cancer data and non-human samples. Telomerecat is designed to be agnostic to the number of telomeres present, making it suited for the purpose of estimating telomere length in cancer studies. Telomerecat also accounts for interstitial telomeric reads and presents a novel approach to dealing with sequencing errors. We show that Telomerecat performs well at telomere length estimation when compared to leading experimental and computational methods. Furthermore, we show that it detects expected patterns in longitudinal data, repeated measurements, and cross-species comparisons. We also apply the method to a cancer cell data, uncovering an interesting relationship with the underlying telomerase genotype

University of Liverpool Repository

Directory of Open Access Journals

White Rose Research Online

Maastricht University Research Portal

Harvard University - DASH

Oxford University Research Archive

Apollo (Cambridge)

King's Research Portal

St George's Online Research Archive

Explore Bristol Research

St Andrews Research Repository